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Abstract 


In this paper, we investigated whether the assimilation of remotely-sensed ehlorophyll data 
can improve the estimates of air-sea carbon dioxide fluxes (FCO 2 ). Using a global, established 
biogeochemical model (NASA Ocean Biogeochemical Model, NOBM) for the period 2003- 
2010, we found that the global FCO 2 values produced in the free-run and after assimilation were 
within -0.6 mol C m' y' of the observations. The effect of satellite chlorophyll assimilation was 
assessed in 12 major oceanographic regions. The region with the highest bias was the North 
Atlantic. Here the model underestimated the fluxes by 1.4 mol C m'^ y'^ whereas all the other 
regions were within 1 mol C m' y of the data. The FCO 2 values were not strongly impacted by 
the assimilation, and the uncertainty in FCO 2 was not decreased, despite the decrease in the 
uncertainty in chlorophyll concentration. Chlorophyll concentrations were within ~25% of the 
database in 7 out of the 12 regions, and the assimilation improved the chlorophyll concentration 
in the regions with the highest bias by 10-20%. These results suggest that the assimilation of 
chlorophyll data does not considerably improve FCO 2 estimates and that other components of the 
carbon cycle play a role that could further improve our FCO 2 estimates. 
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Introduction 

Every year, approximately 90 gigatons of earbon are exehanged between the atmosphere and 
the oeean (Falkowski et ah, 2000). Oeean sinks aeeount for -half of the anthropogenie CO 2 
emissions (Sabine et al., 2004). On longer time seales, deep-oeean export of carbon together 
with land-atmosphere exchanges plays an important role and is expected to be the dominant 
sink of anthropogenic carbon. In the subtropical ocean (~40°N and S), water loses heat to the 
atmosphere and consequently takes up CO 2 from the atmosphere (Takahashi et ah, 2009). 
Inversely to the effect of temperature on pC02, the phytoplankton community takes up CO 2 
and thrives in nutrient-rich (e.g. recently upwelled) water, thereby counteracting and in some 
cases overwhelming the effects of recently cold, C02-rich, upwelled water. The air-sea CO 2 
fluxes in high latitude oceans are governed primarily by deep convection in winter and 
biological uptake during spring and summer. At lower latitudes the warm water increases 
PCO 2 in the ocean and leads to a loss of CO 2 to the atmosphere. Process-based understanding 
of ocean carbon cycle is needed to predict future variations in the efficacy of the oceanic 
carbon sink. 

Takahashi et al. (2002) amassed -1 million surface pC02 observations and used these to 
derive air-sea CO 2 fluxes for a nominal year, namely, 1995 (Takahashi et ah, 2002). 
Uncertainties about these fluxes are directly linked to the spatial and temporal coverage of 
the measurements as well as the relationship between wind speeds on air-sea CO 2 gas 
transfer velocity. Others have used biogeochemical models of different level of complexity, 
whether local or global, to estimate the source and sinks of CO 2 . Gregg et al. (2013) used 
MERRA reanalysis data to force an ocean biogeochemical model (NASA Ocean 
Biogeochemical Model, or NOBM) and compared the fluxes obtained with those from public 
archives. The model produced global mean differences of 0.02% (-0.3 pM) for Dissolved 
Organic Carbon (DIC), -0.3% (-1.2 patm) for pC02 and -2.3% (-0.03 mol C m'^ y'^) for 
FCO 2 compared to in situ estimates. Basin-scale distributions of all three variables were 
significantly correlated with observations. They found that the regions with the largest 
differences between the carbon dioxide fluxes from the model and the observations were in 
the South Atlantic, South Pacific, North Indian, Equatorial Atlantic and North Pacific. 
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Despite some considerable basin-scale and local departures, they found that all basins were 
represented as sources to the atmosphere or sinks in agreement with in situ estimates. 

Data assimilation techniques can be used to constrain the model to track observation time 
series and to optimize certain variables. The assimilation of satellite chlorophyll is known to 
reduce spatial and temporal biases of satellite data. The assimilation of satellite chlorophyll 
produces estimates within 0.1% bias and 33.4% uncertainty as compared to in situ data 
(Gregg, 2008) thereby decreasing the bias of satellite data (-1.3%) while keeping the same 
level of uncertainty (32.7%, Gregg, 2008). Using twin experiments, Tjiputra et al. (2007) 
assimilated SeaWiFS chlorophyll in an adjoint model and found that the model-data misfit 
was reduced and that the posteriori run produced realistic carbon flux prediction. Similarly, 
using 1-D twin experiments for two contrasting locations in the North Atlantic, Hemmings 
et al. (2008) found that the assimilation of ocean color improved surface pC02. Here we 
investigate whether the assimilation of satellite-derived chlorophyll a data can improve 
carbon dioxide fluxes estimates using a global biogeochemical model that assimilates 
satellite-derived chlorophyll. We assess the effects of chlorophyll data assimilation on carbon 
dioxide fluxes globally and in 12 major oceanographic regions (Figure 1) for the period from 
2003 until 2010. 
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Figure 1: Delineation of the 12 major oceanographic basins. 

2. Methods 

Global ocean carbon dynamics were simulated using the NASA Ocean Biogeochemical 
Model (NOBM) for the period 2003-2010. NOBM is a three-dimensional representation of 
circulation/biogeochemical/radiative processes in a domain that spans from -84° to 72° at a 1.25° 
resolution in water deeper than 200 m. NOBM is coupled with the Poseidon ocean general 
circulation model, which is driven by wind stress, sea surface temperature, and shortwave 
radiation (Gregg and Casey, 2007). The model includes 3 detrital pools (silica, N/C and iron 
detritus), 4 phytoplankton groups (diatoms, coccolithophores, chlorophytes and cyanobacteria) 
and 4 nutrients (ammonium, nitrate, iron and silieate). The phytoplankton groups differ in their 
maximum growth rates, sinking rates, nutrient requirements, and optical properties. 

Carbon cycling involves dissolved organic carbon (DOC) and dissolved inorganic carbon 
(DIC). DOC has sources from phytoplankton, herbivores, and carbon detritus, and a sink to DIC. 
Die has sources from phytoplankton, herbivores, carbon detritus, and DOC, and it can be 
exchanged with the atmosphere, which serves as either a source or a sink. The ecosystem sink 
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for Die is phytoplankton, through photosynthesis. A complete description of the model can be 
found in Gregg et al. (2013). The main output of interest in this effort is the flux of CO 2 (FCO 2 , 
notation following Arrigo et ah, 2010; Doney et al., 2009), representing the exchange of carbon 
between the atmosphere and ocean. Positive is defined upward, indicating a source to the 
atmosphere. The fluxes from the model are compared to a data set obtained from the Lamont- 
Doherty Earth Observatory (LDEO) 

(http://cdiac.ornl.gov/oceans/LDEO_Underway_Database/index.html; Takahashi et ah, 2009). 
These in situ LDEO data use pC02 and estimate fluxes from wind speed reanalysis data and 
climatological nutrients from the National Oceanographic Data Center (Takahashi et ah, 2009). 

The model is initialized with DIC from GLODAP (GLobal Ocean Data Analysis Project) and 
is spun up with climatological forcing data from MERRA and atmospheric pC02 data from 
NOAA ESRL for the year 2000 (368.6 patm). The model is spun up for 200 years to remove 
residual trends from the model and produce a steady state (Gregg et ah, 2013). It is then run 
from 2003 to 2010 using time -varying forcing data from MERRA and atmospheric pC02 for 
each year from NOAA ESRL. A concurrent run utilizes MODIS Aqua chlorophyll data in 
assimilation mode (Gregg, 2008). Multi-variate nutrient adjustments corresponding to the 
chlorophyll assimilation (Rousseaux and Gregg, 2012) are also included in this latter run. The 
fluxes derived from a free-running simulation (hereafter referred to as free-run) model are 
compared to those from the run with MODIS Aqua chlorophyll data assimilation and to the 
estimates from Takahashi et al. (2009). The chlorophyll concentrations from both the free-run 
and after assimilation are compared to the Level 3 MODIS Aqua chlorophyll data. 


3. Results 

3.1 Chlorophyll Comparisons with Data 

The correlation coefficients between satellite-derived chlorophyll and chlorophyll from the 
model increased considerably after assimilation. The correlation coefficients for the free-run 
ranged over the years from 0.54 to 0.69, whereas those for the assimilation run ranged from 0.84 
to 0.96 (Figure 2). The correlation between satellite chlorophyll concentration and chlorophyll 
concentration from the model (from either run) was significant for all years between 2003 and 
2010 . 
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Chlorophyll concentrations were within --25% of the satellite chlorophyll in seven of the 12 
regions (Figure 3). The regions with the highest biases (>30%, both in free-run and after 
assimilation) were the North Indian, North Pacific and North Atlantic. In those regions, where 
the bias was high, the assimilation marginally improved the chlorophyll concentration (by 10- 
20%, Figure 4). Note that these regions also correspond to where the highest chlorophyll 
concentration was encountered (-0.55 pg chi L'^). 


Ocean Basin Correlation (r) Chi a 



2003 2004 2005 2006 2007 2008 2009 2010 


Figure 2: Ocean Basin Correlation between the satellite-based estimates of chlorophyll and both the free-run and the 
assimilation values over the different ocean basins. (*) indicates statistical significance (p < 0.05), NS = not 
significant. 
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Figure 3: Chlorophyll concentrations for the free-run, the assimilation run, and the satellite database, globally and in 
the twelve major oceanographic regions. 
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Figure 4: Map of Chlorophyll concentration (pg chi L'*) in June 2005 for (a) MODIS Aqua, (b) Free-run and (c) 
after assimilation of the MODIS Aqua satellite data. 
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3.2 FC02 Comparisons with Data 

Globally and regionally, we found that the fluxes were not greatly impaeted by the 
assimilation of satellite chlorophyll (Figure 5 and Figure 6). In nine out of the twelve regions, the 
difference between the fluxes from the free-run and those after assimilation was less than 0.1 mol 
Cm' y' . The three regions where these differences were greater than 0.1 mol C m' y' were the 
North Central Atlantic, South Pacific and South Indian. In the southern basins the assimilation 
run produced the more biased flux estimates, though the increase in bias was small (Figure 5 and 
Figure 6). 

2 1 

Globally the fluxes produced by the free-run and after assimilation were -0.25 mol Cm' y' , 

which represents an underestimate of FCO 2 by -0.6 mol C m' y' (Figure 5). The region with the 

greatest difference between the data and modeled FCO 2 , for both runs, was the North Atlantic, 

where the model underestimated the fluxes by 1.4 mol Cm' y' . All the other regions were 
2 1 

within -1 mol C m' y' of the data. In the North Atlantic, the assimilation improved the FCO 2 by 
a marginal 0.04 mol Cm' y' . In general the fluxes produced by the model were weaker than 
the in situ estimates (Figure 5). This was especially true in the regions north and south of 10°. 

The regions that served as the highest sinks and sources of FCO 2 were the same before and 

after assimilation. For both runs, the region with the highest FCO 2 sink was the North Atlantic 

( — 0.7 mol C m'^ y'^) followed by the South Atlantic ( — 0.6 mol C m'^ y''). Of the twelve major 

regions, only two (Equatorial Indian and Equatorial Pacific) were a source of CO 2 (PCO 2 of 0.05 

and 0.26 mol C m' y' ). The Equatorial Pacific also became a source of CO 2 after assimilation 
2 1 

(0.24 mol C m' y' ), but the PCO 2 in the Equatorial Indian became slightly negative after the 
assimilation (-0.003 mol C m' y' ). The Equatorial Atlantic and North Indian were the only two 
regions for which the fluxes from the database were positive but the ECO 2 produced by the 
model in both the free-run and assimilation were negative. 

The assimilation of chlorophyll did not decrease the uncertainty in ECO 2 (Eigure 7). There 
was a significant correlation between modeled FCO 2 and the fluxes from the database for six out 
of the eight years analyzed, namely, 2003-2008. Correlation coefficients for these six years were 
a minimum of 0.72 for the free-run and a minimum of 0.76 for the assimilation run. The 
correlation between the data and the model at regional levels was highly dependent on the 
number of observations available. In 2009 and 2010, when the number of months of available 
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data dropped to 7 and 4, the correlation was no longer significant. Out of the six years for which 
significant correlations were found, the correlation coefficients for the assimilation run were only 
slightly improved over those in the free-run for three years. 
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Figure 5: Air-Sea fluxes (mol C m'^ y'*) for the free-run, the assimilation run, and the database globally and in the 
twelve major oeeanographie regions. 
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Figure 6: Map of FCO 2 (mol C m'^ y ') from (a) the LDEO database, (b) the free-run and (c) after assimilation of 
satellite chlorophyll for June 2005. 
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Figure 7: Correlation between the satellite database of FCO 2 and the FCO 2 values produced in the free-run and after 
assimilation. (*) indicates statistical significance (p < 0.05), NS = not significant. 


4. Discussion 

The assimilation of satellite chlorophyll in NOBM was shown to decrease the uncertainties in 
chlorophyll concentration but to have little impact on simulated FCO 2 . The regions where the 
highest biases in FCO 2 were found (North Indian, North Pacific and North Atlantic) were not 
improved by the assimilation. Note that these three regions also show the greatest bias in 
chlorophyll concentration. Monthly maps of chlorophyll concentration (not shown) in these three 
regions highlight the fact that the satellite-based estimates there are particularly high during 
specific times of the year. For example, in the North Pacific and North Atlantic, the satellite 
chlorophyll is considerably higher than the model during the spring and summer. Clouds are one 
of the largest sources of sampling error, especially in the North Pacific (Gregg and Casey, 2007). 
Here, even in the middle of the growing season, there can be <5 days of observations per month. 
The presence of these clouds can lead to overestimate in chlorophyll concentration that are 
driven by period of high growth that occur during limiting periods of clear skies suitable for 
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remote sensing. Gregg and Casey (2007) also reported biases due to elouds in the North 
Atlantie, although they noted that the overestimates tend to be smaller than in the North Paeifie. 
The third region where bias is eonsiderable is the North Indian. In this region, aerosols are 
known to eontribute to sampling error (Gregg and Casey 2007). Aerosol obseurations inerease 
before and during the Monsoon season, at the same time as does ehlorophyll. In this study, we 
found that the satellite ehlorophyll values were considerably higher than those produced by the 
model in either the free-run and assimilation only during the northeast monsoon (between 
November and February). Other sources of bias include sun glint, inter-orbit gap, and sensor tilt 
changes, all of which occur primarily in mid-latitudes (Gregg & Casey 2007). 

The correlations between the FCO 2 values estimated from in situ measurements and those 
produced by the model in either the free-run model/assimilation were significant for the first six 
years of the study (2003-2008). The high correlation coefficients indicate a reasonably low 
uncertainty in our model-derived FCO 2 . The correlation for FCO 2 was again not significant for 
2009 and 2010 due to a lack of data in those years. No clear improvements in either correlation 
or bias resulted from the assimilation of satellite chlorophyll. 

The North Atlantic was the region with the greatest difference between the observations-based 
data and the modeled FCO 2 (from either run). Although the biased sampling in satellite 
chlorophyll is likely to have played a role here, the limited effect the assimilation had on FCO 2 in 
other regions indicates that other processes may be responsible. Chlorophyll is only one of the 
several components in the ocean carbon system, and therefore modest changes from the 
assimilation are not unrealistic. Deficiencies in carbon cycling processes in the model are not 
ameliorated by the improved chlorophyll abundances and distributions associated with the 
satellite chlorophyll assimilation. In fact, the simulated fluxes can actually get worse through 
assimilation (e.g., in the southern basins, as shown in Figure 5). This is likely the result of 
compensating deficiencies in the free-run model, where flaws in the simulated biological 
components compensate flaws in the carbon components. The chlorophyll compensation 
corrects the estimates of chlorophyll but not the underlying biological processes, exposing the 
carbon cycling issues. It is not clear whether this issue is unique to our model formulation or 
whether it is, in fact, general. It is reassuring that the correlations between data estimates and 
models are significant when sufficient in situ data are available, but it is also disconcerting that 
the biases are large. 
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In conclusion, these results suggest that ocean carbon and biological systems are complex and 
that work is needed to improve their representations in models, independently and 
simultaneously. Data assimilation of chlorophyll is not a panacea for model deficiencies. The 
number of products that can be derived from satellite ocean color is increasing, and the 
assimilation of products other than chlorophyll may further improve the fluxes estimates. Finally, 
the regions with the highest biases are eomplex regions which further reinforce our need to 
improve, through combined satellite and modeling efforts, our understanding of the complex 
interactions underlying the global carbon cycle. Further sensitivity analyses are needed to detect 
the components of the ocean carbon system that, if modeled and quantified more accurately, can 
lead to improved FCO2 estimates both regionally and globally. 
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